All you need is a good init

نویسندگان

  • Dmytro Mishkin
  • Jiri Matas
چکیده

Layer-sequential unit-variance (LSUV) initialization – a simple method for weightinitialization for deep net learning – is proposed. The method consists of the twosteps. First, pre-initialize weights of each convolution or inner-product layer withorthonormal matrices. Second, proceed from the first to the final layer, normaliz-ing the variance of the output of each layer to be equal to one.Experiment with different activation functions (maxout, ReLU-family, tanh) showthat the proposed initialization leads to learning of very deep nets that (i) producesnetworks with test accuracy better or equal to standard methods and (ii) is at leastas fast as the complex schemes proposed specifically for very deep nets such asFitNets (Romero et al. (2015)) and Highway (Srivastava et al. (2015)).Performance is evaluated on GoogLeNet, CaffeNet, FitNets and Residual nets andthe state-of-the-art, or very close to it, is achieved on the MNIST, CIFAR-10/100and ImageNet datasets.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

All It Takes for Corruption in Health Systems to Triumph, Is Good People Who Do Nothing; Comment on “We Need to Talk About Corruption in Health Systems”

Numerous investigations demonstrate that the problem of corruption in the health sector is enormous and has grave negative consequences for patients. Nevertheless, the problem of corruption in health systems is far from eminent in the international health policy debate. Hutchinson, Balabanova, and McKee have identifed in their Editorial five reasons why the health policy community has been relu...

متن کامل

قانون طلایی تدارک حمایت از دانش آموزان با نیازهای ویژه در کلاسهای فراگیر: از دیگران آنطور حمایت کنید که دوست دارید از شما حمایت کنند

Consider for a moment that the school system paid someone to be with you supporting you 8 hours a day, 5 days a week. Now, imagine that you had no say over who that support person was or how she or he supported you. Or imagine that someone regularly stopped into your place of employment to provide you with one-on-one support. This person was present for all your interactions, escorted you to th...

متن کامل

Discourse Chunking: A Tool in Dialogue Act Tagging

Discourse chunking is a simple way to segment dialogues according to how dialogue participants raise topics and negotiate them. This paper explains a method for arranging dialogues into chunks, and also shows how discourse chunking can be used to improve performance for a dialogue act tagger that uses a case-based reasoning approach. 1 Dialogue act tagging A dialogue act (hereafter DA) is an en...

متن کامل

رژیم غذایی در بیمارانی که پیوند کلیه شده اند

Diet of patient who has tolerated kidney transplantation is differing from the past. Knowing about this subject, help patient choose proper diet. Patient may be has many questions about his/her diet that must be considered. Do you need to special diet? Yes, after kidney transplantation, diet playing important role. You understanding that attention to diet after transplantation are easier than d...

متن کامل

P25: Talent and Perseverance

Many people think that all you need to succeed at anything is talent but talent alone without perseverance and determination, cannot help you achieve success. Talent is helpful but perseverance ensured one achieves success. A child can show an exceptional talent for storytelling, but if he ignores his teacher’s comments and doesn’t work on his stories, he will never be a great novel...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1511.06422  شماره 

صفحات  -

تاریخ انتشار 2015